Assessing Bayesian Semi‐Parametric Log‐Linear Models: An Application to Disclosure Risk Estimation
نویسندگان
چکیده
We propose a method for identifying models with good predictive performance in the family of Bayesian log-linear mixed Dirichlet process random effects count data. Their wide applicability makes assessment model crucial many fields, including disclosure risk estimation, which is focus present work. Rather than assessing on whole contingency table, we target specific objective analysis and two-stage selection procedure aimed at limiting form bias arising selection. Our proposal combines two different criteria: first stage, path search space identified through strongly penalized log-likelihood; second, small number semi-parametric evaluated context-dependent score-based information criterion. Tested variety tables, our proves to be able identify few steps, even presence large tables sampling structural zeros. carefully discuss proposed context literature contextualize illustrative application recent debate statistical limitation. Finally, provide examples further applications research areas.
منابع مشابه
Bayesian semiparametric copula estimation with application to psychiatric genetics.
This paper proposes a semiparametric methodology for modeling multivariate and conditional distributions. We first build a multivariate distribution whose dependence structure is induced by a Gaussian copula and whose marginal distributions are estimated nonparametrically via mixtures of B-spline densities. The conditional distribution of a given variable is obtained in closed form from this mu...
متن کاملModel Selection for Semiparametric Bayesian Models with Application to Overdispersion
In analyzing complicated data, we are often unwilling or not confident to impose a parametric model for the data-generating structure. One important example is data analysis for proportional or count data with overdispersion. The obvious advantage of assuming full parametric models is that one can resort to likelihood analyses, for instance, to use AIC or BIC to choose the most appropriate regr...
متن کاملBayesian Semiparametric GARCH Models
This paper aims to investigate a Bayesian sampling approach to parameter estimation in the semiparametric GARCH model with an unknown conditional error density, which we approximate by a mixture of Gaussian densities centered at individual errors and scaled by a common standard deviation. This mixture density has the form of a kernel density estimator of the errors with its bandwidth being the ...
متن کاملBayesian estimation of semiparametric duration models with spatial dependence
Work subject to heavy progress, please do not distribute. Acknowledgements: We would like to thank François Fontaine for usefull discussions.
متن کاملReleasing Microdata: Disclosure Risk Estimation, Data Masking and Assessing Utility
Statistical agencies release sample microdata from social surveys under different modes of access ranging from Public Use Files (PUF) in the form of tables or highly perturbed datasets to Microdata Under Contract (MUC) for researchers and licensed institutions where levels of protection are less severe. In addition, statistical agencies often have on-site datalabs where registered researchers c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Statistical Review
سال: 2021
ISSN: ['0306-7734', '1751-5823']
DOI: https://doi.org/10.1111/insr.12471